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Abstract 

We prove that, in the Gilbert model for a random geometric graph, almost 
every graph becomes Hamiltonian exactly when it first becomes 2-connected. 
This proves a conjecture of Penrose. 

We also show that in the /c-nearest neighbour model, there is a constant k 
such that almost every K-connectcd graph has a Hamilton cycle. 

1 Introduction 

In this paper we mainly consider one of the frequently studied models for random 
geometric graphs, namely the Gilbert Model. Suppose that S n is a ^/n x yjn box 
and that V is a Poisson process in it with density 1. The points of the process form 
the vertex set of our graph. There is a parameter r governing the edges: two points 
are joined if their (Euclidean) distance is at most r. 

Having formed this graph we can ask whether it has any of the standard graph 
properties, such as connectedness. As usual, we shall only consider these for large 
values of n. More formally, we say that G = G n ^ r has a property with high probability 
(abbreviated to whp) if the probability that G has this property tends to one as n 
tends to infinity. 
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Penrose jlOj proved that the threshold for connectivity is irr 2 = logn. In fact he 
proved the following very sharp result: suppose 7rr 2 = logn + a for some constant 
a. Then the probability that G n r is connected tends to e e 

He also generalised this result to find the threshold for K-connectivity: namely 
irr 2 = log n+(«— 1) log logn. Moreover, he found the "obstruction" to ^-connectivity. 
Suppose we fix the vertex set (i.e., the point set in S n ) and "grow" r. This gradually 
adds edges to the graph. For a monotone graph property P let ~H(P) denote the 
smallest r for which the graph on this point set has the property P. Penrose showed 
that 

H(5(G) >k) = H (connectivity (G) > k) 

whp: that is, as soon as the graph has minimum degree k it is K-connected whp. 

He also considered the threshold for G to have a Hamilton cycle. Obviously a 
necessary condition is that the graph is 2-connected. In the normal (Erdos-Renyi) 
random graph this is also a sufficient condition in the following strong sense. If we 
add edges to the graph one at a time then the graph becomes Hamiltonian exactly 
when it becomes 2-connected (see [H],[9],[8] and [5]). 

Penrose, conjectured that the same is true for a random geometric graph. In this 
paper we prove the following theorem proving the conjecture. 

Theorem 1. Suppose that G = G n)T the two-dimensional Gilbert Model. Then 

7i(G is 2-connected) = Tt{G has a Hamilton cycle) 

whp. 

Combining this with the earlier result of Penrose we see that, if irr 2 = logn + 
log log n + q then the probability that G has a Hamilton cycle tends to e~ e a . 

Some partial progress has been made on this conjecture previously. Petit |13| 
showed that if irr 2 /logn tends to infinity then G is, whp, Hamiltonian, and Diaz, 
Mitsche and Perez [7] proved that if irr 2 > (1 + e)\ogn for some e > then G is 
Hamiltonian whp. (Obviously, G is not Hamiltonian if irr 2 < logn since whp G is 
not connected!) Finally in [1] Balogh, Kaul and Martin proved Penrose's conjecture 
for the special case of the norm in two dimensions. 

Our proof generalises to higher dimensions, and to other norms. The Gilbert 
Model makes sense with any norm and in number of dimensions: we let S% be the 
d-dimensional hypercube with volume n. We prove the analogue of Theorem Q] in 
this setting. 

Theorem 2. Suppose that the dimension d and \\-\\, a p-norm for some 1 < p < oo, 
are fixed. Let G = G nx be the resulting Gilbert Model. Then 

7i(G is 2-connected) = 7i{G has a Hamilton cycle) 
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whp. 

The proof is very similar to that of Theorem [TJ However, there are some signifi- 
cant extra technicalities. 

To give an idea why these occur consider connectivity in the Gilbert Model in 
the cube S% (with the Euclidean norm). Let A be the volume of a sphere of radius r. 
We count the expected number of isolated points in the process which are away from 
the boundary of the cube. The probability a point is isolated is e~ A so the expected 
number of such points is ne~ A , so the threshold for the existence of a central isolated 
point is about A = logn. 

However, consider the probability that a point near a face of the cube is isolated: 
there are approximately n 2//3 such points and the probability that they are isolated is 
about e~ A l 2 (since about half of the sphere about the point is outside the cube S 3 ). 
Hence, the expected number of such points is n 2 ^ 3 e~ A ^ 2 , so the threshold for the 
existence of an isolated point near a face is about A = ^ log n. In other words 
isolated points are much more likely near the boundary. These boundary effects are 
the reason for many of the extra technicalities. 

The /c-nearest neighbour model 

We also consider a second model for random geometric graphs: namely the /c-nearest 
neighbour graph. In this model the initial setup is the same as in the Gilbert model: 
the vertices are given by a Poisson process of density one in the square S n , but this 
time each vertex is joined to its k nearest neighbours (in the Euclidean metric) in the 
box. This naturally gives rise to a fc-regular directed graph, but we form a simple 
graph G = G n> k by ignoring the direction of all the edges. It is easily checked that 
this gives us a graph with degree between k and 6k. 

Xue and Kumar [15] showed that there are constants c\ , C2 such that if k < c\ log n 
then the graph G n ^ is, whp, not connected, and that if k > C2logn then G n & is, 
whp, connected. Balister, Bollobas, Sarkar and Walters pQ proved reasonably good 
bounds on the constants: namely c\ = 0.3043 and C2 = 0.5139, and later [3] proved 
that there is some critical constant c such that if k = d log n for d < c then the 
graph is disconnected whp, and if k = c'logn for d > c then it is connected whp. 
Moreover, in [2], they showed that in the latter case the graph is s-connected whp 
for any fixed s G N. 

We would like to prove a sharp result like the above: i.e., that as soon as the 
graph is 2-connected it has a Hamilton cycle. However, we prove only the weaker 
statement that some (finite) amount of connectivity is sufficient. Explicitly, we show 
the following. 
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Theorem 3. Suppose that k = k(n), that G = G n j. is the k-nearest neighbour graph, 
and that G is K-connected for k = 5 • 10 7 whp. Then G has a Hamilton cycle whp. 

2 Proof of Penrose's Conjecture 

We divide the proof into five parts: first we tile the square S n with small squares 
in a standard tessellation argument. Secondly we identify "difficult" subsquares. 
Roughly, these will be squares containing only a few points, or squares surrounded 
by squares containing only a few points. Thirdly we prove some lemmas about the 
structure of the difficult subsquares. In stage 4 we deal with the difficult subsquares. 
Finally we use the remaining easy subsquares to join everything together. 

Stage 1: Tessellation. 

Let ro = WQognf/TT (so ixr^ = logn), and let r be the random variable 
TL{G is 2-connected). Let s = r^/c = c'ydogn where c is a large constant to be 
chosen later (1000 will do). We tessellate the box S n with small squares of side 
length s. Whenever we talk about distances between squares we will always be re- 
ferring to the distance between their centres. Moreover, we will divide all distances 
between squares by s, so, for example, a square's four nearest neighbours all have 
distance one. 

By the result of Penrose [llj mentioned in the introduction we may assume that 
(1 — l/2c)ro < r < (1 + l/2c)ro: formally the collection of point sets which do not 
satisfy this has measure tending to zero as n tends to infinity, and we ignore this set. 

Hence points in squares at distance r ~^ s > r °~ 2s = (c — 2) are always joined 
and points in squares at distance r+v ^ s < r '°+ 2s = (c + 2) are never joined. 

Stage 2: The "difficult" subsquares. 

We call a square full if it contains at least M points for some M to be determined 
later (10 7 will do), and non-full otherwise. Let Nq be the set of non-full squares. We 
say two non-full squares are joined if their distance is at most Ac — 1 and define 
M to be the collection of non-full components. 

First we bound the size of the largest component of non-full squares. 

Lemma 4. For any M , the largest component of non-full squares in the above tes- 
selation has size at most 

U= rvr(c + 2) 2 ] 

whp. 
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Also, the largest component of non-full squares including a square within c of the 
boundary of S n has size at most U/2 whp. Finally, there is no non-full square within 
distance Uc of a corner whp. 

Proof. We shall make use of the following simple result: suppose that G is any graph 
with maximal degree A and v is a vertex in G. Then the number of connected subsets 
of size n of G containing v is at most eA n (see e.g., Problem 45 of [6]). 

Hence, the number of potential components of size U containing a particular 
square is at most (e(8c) 2 ) u so, since there are less than n squares, the total number 
of such potential components is at most n(e(8c) 2 ) u . The probability that a square 
is non-full is at most 2s 2M e~ s /Ml. Hence, the expected number of components of 
size at least U is at most 

n(2s™ e-\e^T) m v < a (j (log,.)" f^fl)" exp (- (c + % '° 8 " ) 

which tends to zero as n tends to infinity: i.e., whp, no such component exists. 

For the second part there are at most 4c^/n squares within distance c of the 
boundary of S n and the result follows as above. 

Finally, there are only 4U 2 c 2 squares within distance c of a corner. Since the 
probability that a square is non-full tends to zero we see that there is no such square 
whp. □ 

Note that this is true independently of M which is important since we will want to 
choose M depending on U. 

In the rest of the argument we shall assume that there is no non-full component 
of size greater than U, no non-full component of size U/2 within c of an edge and 
no non-full square within Uc of a corner. 

Between these components of non-full squares there are numerous full squares. 
To define this more precisely let G be the graph with vertex set the small squares, 
and where each square is joined to all others within (c — 2) of this square (in the 
Euclidean norm). The graph G \ Nq has one very large component A which we call 
the sea. We call the squares in this sea the easy squares and the remaining squares 
the difficult squares. 

The idea is that it is trivial to find a cycle visiting every point of the process 
in a square in the sea A, and that we can extend this cycle to a Hamilton cycle by 
adding each non-full component (and any full squares cut off by it) one at a time. 
However, it is easier to phrase the argument by starting with the difficult parts and 
then using the sea of full squares. 

Stage 3: The structure of the difficult subsquares. 
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Square in A 
Close Square 
Far square 

The squares Qi and Q2 
Path from far square to A 
Path through A from Q2 to Qi 



Figure 1: A small part of S n containing the non-full component N and the corre- 
sponding set A, far squares and close squares. It also shows the two vertex disjoint 
paths from the far squares to A and the path joining Q2 to Q\ (see stage 4). 



Consider one component N 6 N of the non-full squares, and suppose that it has 
size u. By Lemma 0] we know u < U. We will also consider N^c- the 2c-blow-up of 
N: that is the set of all squares with distance at most 2c from a square in N. 

Now some full squares may be cut off from the rest of the full squares by non-full 
squares in N. More precisely the graph G\N has one giant component A = A(N); we 
call A c the cutoff squares. Note that A is a superset of the sea A, since we have only 
removed one component of the non-full squares. We shall see later (Corollary [TUj) 
that the sea is DiVeA/" 

We split the cutoff squares into two classes: those with a neighbour in A (in 
G) which we think of as being "close" to A, and the rest, which we shall call far 
squares. All the close squares must be in N (since otherwise they would be part of 
A). However, we do not know anything about the far squares: they may be full or 
non-full. See Figure [1] for a picture. 

Lemma 5. No two far squares are more than distance c/10 apart. 

Remark: this does not say whp since we are assuming this non-full component 
has size at most U. 
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Proof. Suppose not. 

Suppose, first, that no point of N is within c of the edge of S n , and that the 
two far squares are at horizontal distance at least c/10. Then consider the left most 
far square. All squares which are to the left of this and with distance to this square 
less than (c — 2) must be close and thus in N. Similarly with the right most far 
square. Also at least (c — 2) squares (in fact nearly 2(c — 2)) in each of at least c/10 
columns between the orginal two far squares must be in N. This is a total of about 
7r(c — 2) 2 + (c — 2)c/10 > U which is a contradiction (provided we chose c reasonably 
large) . 

If there is a point of N within c of the boundary then the above argument gives 
more than U/2 non-full squares. Indeed, either it gives half of each part of the above 
construction, or it gives all of one end and all the side parts. This contradicts the 
second part of our assumption about the size of non-full components. 

We do not need to consider a component near two sides: it cannot be large 
enough to be near two sides. It also cannot go across a corner, since no square 
within distance Uc of a corner is non-full. □ 

This result can also be deduced from a result of Penrose as we do in the next 
section. We have the following instant corollary. 

Corollary 6. The graph G restricted to the far squares is complete. □ 

Corollary 7. The set T(A C ) of neighbours in G of the cutoff squares A c is contained 
in N2c- 

Proof. We prove that A c C N c the c-blow-up of N. Indeed, suppose not. Let x be 
a square in A c \ N c . First, x cannot be a neighbour of any square in A or x would 
also be in A; i.e., x is a far square. 

Now, let y be any square with distance c/5 from x. The square y cannot be 
in N since then x would be in N c . Therefore, y cannot be a neighbour of any square 
in A since then it would be in A and, since x and y are joined in G, x would be in 
A; i.e., y is also a far square. Hence, x and y are both far squares with distance 
c/5 which contradicts Lemma [5j 

□ 

In particular, Corollary [7] tells us that the sets of squares cutoff by different non- 
full components and all their neighbours are disjoint (obviously the 2c-blow-ups are 
disjoint). 

The final preparation we need is the following lemma. 
Lemma 8. The set N2 C H A is connected in G. 
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Since the proof will be using a standard graph theoretic result, it is convenient to 
define one more graph G\\ again the vertex set is the set of small squares, but this 
time each square is joined only to its four nearest neighbours: i.e., G\ is the ordinary 
square lattice. We need two quick definitions. First, for a set E 6 G\ we define the 
boundary d\E of E to be set of vertices in E c that are neighbours (in G±) of a vertex 
in E. Secondly, we say a set E in G\ is diagonally connected if it is connected when 
we add the edges between squares which are diagonally adjacent (i.e. at distance y/2) 
to G. The lemma we need is the following; since its proof is short we include it here 
for completeness. 

Lemma 9. Suppose that E is any subset of G\ with E and E c connected. Then d±E 
is diagonally connected: in particular, it is connected in G. 

Proof. Let F be the set of edges of G\ from E to E c , and let F 1 be the corresponding 
set of edges in the dual lattice. Consider the set F' as a subgraph of the dual lattice. 
It is easy to check that every vertex has even degree except vertices on the boundary 
of G±. Thus we can decompose F' into pieces each of which is either a cycle or a 
path starting and finishing at the edge of G\. Any such cycle splits G\ into two 
components, and we see that one of these must be exactly E and the other E c . Thus 
F' is a single component in the dual lattice, and it is easy to check that implies that 
d\E is diagonally connected. □ 

Proof of Lemma\^ Consider G\ \ N2 C - This splits into components B\, F>2, . . . , B m . 
By definition each B{ is connected. Moreover, each Bf is also connected. Indeed, 
suppose x, y £ Bf. Then there is an xy path in G\. If this is contained in Bf we are 
done. If not then it must meet A^2 C , but N2 C is connected. Hence we can take this 
path until it first meets N2 C go through N2 C to the point where the path last leaves 
N2c and follow the path on to y. This gives a path in Bf. 

Hence, by Lemma [9J we see that each d\Bi is connected in G for each i (where 
d\ denotes the boundary in G\). Obviously d\Bi C iV^c- 

As usual, for a set of vertices V let G[V] denote the graph G restricted to the 
vertices in V. 

Claim. Any two vertices in DfL^iBi are connected in G[Af] N2 C ]- 

Proof. Suppose not. Without loss of generality assume that, for some k < m, 
G[U^ =l diBi] is connected and that no other d\Bi is connected via a path to it. 
Pick x € B\ and y 6 B m . Both x and y are in A (since they are not in N2 C and 
A c C N 2c by Corollary [TJ). 

Hence there is a path from x to y in A. Consider the last time it leaves U^ =1 -Bj. 
The path then moves around in N2 C before entering some Bj with j > k. This gives 
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rise to a path in A n N 2c from a point in U^ =l d\Bi to a point in dBj, contradicting 
the choice of A;. □ 

We now complete the proof of Lemma El To avoid clutter we shall say that two 
points are joined if they are connected by a path. Suppose that x, y G An N 2c . Since 
A is connected there is a path in A from x to y. If the path is contained in A^2 C we 
are done. If not, consider the first time the path leaves N 2c . It must enter one of 
the Bi, crossing the boundary d\Bi. Hence x is joined to some w G d\Bi in An N 2c . 
Similarly, by considering the last time the path is not in N 2c we see that y is joined 
to some z G d\Bj for some j. However, since the claim showed that w and z are 
joined in A n N 2c , we see that x and y are joined in A n N 2c . □ 

Corollary 10. The set of sea squares A = P\ N£cN A(N). In particular, for any 
N G N we have AnN 2c = A{N) n N 2c . 

Proof. We prove that the set A 1 = DiVeA/" -^(-^0 ^ s connected in G. 

Given two squares x,y in A' pick a path in G from x to y. Now for each non-full 
component iV in turn do the following. If the path misses N 2c do nothing. Otherwise 
let w be the first point on the path in N 2c and z be the last point in N 2c . Replace 
the xy path by the path xw, any path wz in A(N) n N 2c and then the path zy. 

At each stage the modification ensured that the path now lies in A(N). Also, 
the only vertices added to the path are in N 2c which is disjoint from all the previous 
N' 2c , and thus from all previous sets A(N'). Hence, when we have done this for all 
non-full components the path lies in every A(N'), i.e., in A' . Hence, A' is connected. 
Since it contains A we must have A' = A. 

Finally, by CorollaryEl A(N') D N 2c for all N' / N, so Af]N 2c = A(N)nN 2c . □ 

Stage 4 : Dealing with the difficult sub squares. 

We deal with each non-full component N £ M in turn. Fix one such compo- 
nent N. 

Let us deal with the far squares first. There are three possibilities: the far squares 
contain no points at all, they contain one point in total, or they contain more than 
one point. In the first case, do nothing and proceed to the next part of the argument. 

In the second case, by the 2-connectivity of G, we can find two vertex disjoint 
paths from this single vertex to points in squares in A. In the third case pick two 
points in the far squares. Again by 2-connectivity we can find vertex disjoint paths 
from these two vertices to points in squares in A. 

Suppose that one of these paths meets A in square Q\ and the other in square 
Q 2 . Let Pi, P 2 be the squares containing the previous points on these paths. No two 
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points in squares at (Euclidean) distance (c + 2) are joined we see that Pi is within 
(c + 2) of Q\. Since Pi E" A we have that some square on a shortest P\Q\ path in 
G\ is in N and thus that Q\ £ A^ c - Similarly Q2 £ A^ c - Combining we see that 
both Qi and Q2 are in N2 C H A. By Lemma [H we know that iV2 C H ^4 is connected 
in G so we can find a path from Q\ to Q2 in N2 C n 4 in G. This "lifts" to a path 
in G going from the point in Q2 to a point in Q\ using at most one vertex in each 
subsquare on the way and never leaving Ni c . 

Place the path from the point in Q\ to the far vertex; round all points in the far 
region finishing back at the second chosen vertex and then to Q2 and then through 
the sea as above back to Q\. Since Q\ E A n A^c, by Corollary 1101 we have that 
Qi E A. Combining, we have a path starting and finishing in the same subsquare of 
the sea (i.e., Q\) containing all the vertices in the far region. 

Next we deal with the close squares: we deal with each close square P in turn. 
Since P is a close square we can pick Q E A with PQ joined in G. 

If the square P has no point in it we ignore it. If it has one point in it then join 
that point to two points in Q. 

If it has two or more points in it then pick two of them x,y: and pick two points 
uv in Q. Place the path formed by the edge ux round all the remaining unused 
vertices in the cutoff square finishing at y and back to the square Q with the edge 
yv in the cycle we are constructing. 

The square Q is a neighbour of P E A c so, by Corollary [7] is in A^c- Since Q is 
also in A we see, by Corollary [10] as above, that Q E A. 

When we have completed this construction we have placed every vertex in a 
cutoff square on one of a collection of paths each of which starts and finishes at the 
same square in the sea (although different paths may start and finish in different 
squares in the sea). 

We use at most 2 £7 + 2 vertices from any square in A = A(N) when doing this so, 
provided that M > 2U + 2 + (2c + l) 2 , there are at least (2c + l) 2 unused vertices in 
each square of A when we finish this. Moreover, obviously the only squares touched 
by this construction are in N2 C and for distinct non-full components these are all 
disjoint. Hence, when we have done this for every non-full component N E M there 
are at least (2c + l) 2 unused vertices in each square of the sea A. 

Stage 5: Using the subsquares in the sea to join everything together. 

It just remains to string everything together. This is easy. Take a spanning tree 
for A, the sea of squares. By doubling each edge we can think of this as a cycle, as in 
Figure [2j This cycle visits each square at most (2c + l) 2 times. (In fact, by choosing 
a spanning tree such that the sum of the edge lengths is minimal we could assume 
that it visits each vertex at most six times but we do not need this.) Convert this 
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Figure 2: A tree of subsquares and its corresponding tree cycle. 

into a Hamilton cycle as follows. Start at an unused vertex in a square of the sea. 
Move to any (unused) vertex in the next square in the tree cycle. Then, if this is 
the last time the tree cycle visits this square visit all remaining vertices and join in 
all the paths constructed in the first part of the argument, then leave to the next 
square in the tree cycle. If it is not the last time the tree cycle visits this square 
then move to any unused vertex in the next square in the tree cycle. Repeat until we 
complete the tree cycle. Then join in any unused vertices and paths to this square 
constructed earlier before closing the cycle. □ 

3 Higher Dimensions 

We generalise the proof in the previous section to higher dimensions. Much of the 
argument is the same, in particular, essentially all of stages four and five. We include 
details of all differences but refer the reader to the previous section where the proof 
is identical. 

Stage 1: Tessellation. 

We work in the (i-dimensional hypercube of volume n (for simplicity we will 
abbreviate hypercube to cube in the following). As mentioned in the introduction, we 
no longer have a nice formula for the critical radius: the boundary effects dominate. 

Instead, we consider the expected number of isolated vertices E = E(r). We need 
a little notation: let A r denote the set {x : d(x, A) < r} and | • | denote Lebesgue 
measure. 
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We have E = ( a d exp(—\{x} r \)dx. Let 7*0 = 7*o(n) be such that E(ro) = 1. As 
before fix c a large constant to be determined later, and let s = r$/c. It is easy to 
see that r d = 0(logn) and s d = 0(logn). We tile the cube S d with small cubes of 
side length s. 

As before, let r = 7i(G is 2-connected). By Penrose [llj.[12j the probability that 
r £ [7*o(l — l/2c), 7*o(l + l/2c)] tends to zero and we ignore all these points sets. This 
time any two points in cubes at distance r ~ s ^ > r Q~ ds = ( c _ ^ are joined, and no 
points in cubes at distance r+s ^ < r °+ ds = ( c + d) are joined. 
S'iage T/ie "difficult" subcubes. 

Exactly as before we define non-full cubes to be those containing at most M 
points, and we say two are joined if they have distance at most 4c — 1. 

We wish to prove a version of Lemma HI However, we have several possible 
boundaries: for example, in three dimensions we have the centre, the faces, the 
edges and the corners. We call a non-full component containing a cube Q bad if it 
consists of at least (1 + l/c)\Q ro \/s d cubes. (Note a component can be bad for some 
cubes and not others). 

Lemma 11. The expected number of bad components tends to zero as n tends to 
infinity. In particular there are no bad components whp. 

Proof. The number of components of size U containing a particular cube is at most 
(e{8c) d ) u . The probability that a cube is non- full is at most 2s dM e~ s /Ml. 

Since min{|Q ro | : cubes Q} = 0(logn) and s d = 0(logn), the expected number 
of bad components is at most 

]T (2 S dM e - sd (e(8c) d )/M!)( 1+1 / c )l^ol/^ 

cubes Q 

= (2s dM (e(8c) d )/M!)( 1+1 / c )l Q ''ol/^exp(-(l + l/c)|Q ro |) 

cubes Q 

= o(l) exp(-|Q ro |) 

cubes Q 

< o(l) / exp(-|{a"} ro |)cfa; 

= o(l)E(r ) 
= o(l). 



□ 
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(Again, note that this is true independently of M.) 

From now on we assume that there is no bad component. 

Stage 3: The structure of the difficult subcubes. 

In this stage we will need one extra geometric result of Penrose, a case of Propo- 
sition 2.1 of [11] or Proposition 5.15 of [12j . 

Proposition 12. Suppose d is fixed and that \\ ■ \\ is a p-norm for some 1 < p < oo. 
Then there exists r\ > such that if F C O d ( the positive octant in M. d ) with £oo 
diameter at least r/10 and x is the point of F with the smallest l\ norm; then 
\F r \ > \F\ + \{x} r \ + i]r d . 

We begin this stage by proving Lemma [5] for this model. 

Lemma 13. No two far cubes are more than £oo distance c/10 apart. 

Proof. Suppose not. Then let F be the set of far cubes, let x be the point of F 
closest to a corner in the l\ norm and let Q be the cube containing x. We know that 
all the cubes within (c — d) of a far cube are not in A. Hence all such cubes which 
are not far must be close, and thus non-full. 
The number of close cubes is at least 

\F {c -2d)s\F\ \{x} {c ^d)s\+v((c-2d)s) d 

— ^ > by Proposition 112] 

s a s a 

. IQ(c-3d)sl+^o/2 . . 

> -j provided c is large enough 

s a 

|Q(l-3d/c)r l +yr$/2 

- s d 



> {l-U/c) d \Q n) \+ W d /2 

— s d 

(l + l/c)|Q ro | . 

> j , provided c is large enough. 

s a 

This shows that the component is bad which is a contradiction. □ 

Corollaries [6] and [7] hold exactly as before. Lemma [8] also holds, we just need to 
replace Lemma [9] by the following higher dimensional analogue. Note that, even in 
higher dimensions we say two squares are diagonally connected if their centres have 
distance y/2. 

Lemma 14. Suppose that E is any subset of G\ with E and E c connected. Then 
d\E is diagonally connected: in particular, it is connected in G. 
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Proof. Let I be a (diagonally connected) component of d\E. We aim to show the 
I = d\E and, thus, that d\E is diagonally connected. 

Claim. Suppose that C is any circuit in G\ . Then the number of edges of C with 
one end in E and the other end in I is even. 

Proof of claim. We say that a circuit is contractible to a single point using the 
following operations. First, we can remove an out and back edge. Secondly, we 
can do the following two dimensional move. Suppose that two consecutive edges 
of the circuit form two sides of a square; then we can replace them by the other 
two sides of the square keeping the rest of the circuit the same. For example, 
we can replace (x,y + 1, z) — ► (x + l,y + 1, z) — > (x + l,y, z) in the circuit by 
(x,y+ l,z) -> (x,y,z) -> (x + l,y,z). 

Next we show that C is contractible. Let w(C) denote the weight of the circuit: 
that is the sum of all the coordinates of all the vertices in C. We show that, if C 
is non-trivial, we can apply one of the above operations and reduce w. Indeed, let 
v be a vertex on C with maximal coordinate sum, and suppose that V- and v+ are 
the vertices before and after v on the circuit. If V- = t>+ then we can apply the first 
operation removing v and v + from the circuit which obviously reduces w. If not, 
then both V— and v + have strictly smaller coordinate sums than v and we can apply 
the second operation reducing w by two. We repeat the above until we reach the 
trivial circuit. 

Now, let J be the number of edges of C with an end in each of E and I. The first 
operation obviously does not change the parity of J. A simple finite check yields 
the same for the second operation. Indeed, assume that we are changing the path 
from (x, y + 1), (x + l,y + 1), (x + 1, y) to (x, y + 1), (x, y), (x + 1, y). Let F be 
the set of these four vertices. If no vertex of / is in F then obviously J does not 
change. If there is a vertex of I in F then, by the definition of diagonally connected, 
F n / = F n d\E. Hence the parity of J does not change. (It is even if (x, y + 1) and 
(x + 1, y) are both in E or both in E c and odd otherwise.) □ 

Suppose that there is some vertex v G d\E\I and that u G E is a neighbour of 
v. Let y £ I and x G E be neighbours. Since E and E° are connected we can find 
paths P xu and P vy in E and E c respectively. The circuit P xu ,uv, P vy ,yx contains a 
single edge from E to I which contradicts the claim. □ 

To complete this stage observe that Corollary [10] holds as before. 

Stage 4-' Dealing with the difficult subcubes, and Stage 5: Using the subcubes in the 
sea to join everything together. 
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These two stages go through exactly as before (with one trivial change: replace 
(2c + l) 2 by (2c + l) d ). This completes the proof of Theorem [2 □ 

4 Proof of Theorem [3] 

In this section we prove Theorem [31 Once again, the proof is very similar to that in 
Section [2j We shall outline the key differences, and emphasise why we are only able 
to prove the weaker version of the result. 

Stage 1: Tessellation. 

The tessellation is similar to before, but this time some edges may be much longer 
than some non-edges. 

Let k = 7i(G is K-connected) be the smallest k that n ^ is K-connected. Since G 
is connected we may assume that 0.3 log n < k < 0.52 log n (see PQ). Let r_ be such 
that any two points at distance r_ are joined whp: e.g., Lemma 8 of [T] implies that 
this is true provided 7rr 2 < 0.3e~ 1-1 / ' 3 logn, so we can take r_ = 0.035v / Tog~n. 

Let r + be such that no edge in the graph has length more than r + . Then, again 
by [I], we have 

vrr 2 < 4e(l + 0.52) < 17 

whp, so we can take r + = 2.3y / log n < 66r_. 

Prom here on, we ignore all point sets with an edge longer than r + or a non-edge 
shorter than r_. 

Let s = T—/y/%. We tessellate the box S n with small squares of side length s. 
(Since we are proving only this weaker result our tesselation does not need to be 
very fine.) By the choice of s and the bound on r_ any two points in neighbouring 
or diagonally neighbouring squares are joined in G. Also, by the bound on r + no 
two points in squares with centres at distance more than (66\/5 + 2)s < 150s are 
joined. Let D = 10 4 ; we have that no two points in squares with centres distance 
Ds apart are joined. 

Stage 2: The "difficult" subsquares. 

We call a square full if it contains at least M = 10 9 points and non-full otherwise. 
We say two non-full squares are joined if they are at £oo distance at most 2D — 1. 
First we bound the size of the largest component of non-full squares. 

Lemma 15. The largest component of non-full squares has size less than 7000 whp. 

Proof. The number of components of size 7000 containing a particular square is at 
most (e(4D) 2 ) 7000 so, since there are less than n squares, the total number of such 
components is at most n(e(4D) 2 ) 7000 . The probability that a square is non-full is 
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Figure 3: Two paths from one cutoff square to the sea together with the path from 
the meeting point in Q2 to the square Q±. 

at most 2s 2M e~ s2 /M !. Hence, the expected number of components of size at least 
7000 is at most 

n(2s 2M e- s2 (e(4D) 2 )/M\) 7000 

( n /(0.035) 2 logn\ M e(4L») 2 V° 00 ( -7000(0.035) 2 log n 
^ n [ 2 { 5 ) -MT) 6XP 

which tends to zero as n tends to infinity (since 7000(— 0.035) 2 /8 > 1.07 > 1): i.e., 
whp, no such component exists. □ 

In the rest of the argument we shall assume that there is no non-full component 
of size greater than 7000. 

Stage 3: The structure of the difficult subsquares. 

As usual we fix one component N of the non-full squares, and suppose that it has 
size u (so we know u < 7000). This time we define G to be the graph on the small 
squares where each square is joined to its eight nearest neighbours (i.e., adjacent 
and diagonal). Let A be the giant component of G \ N, and again split the cutoff 
squares into close and far depending whether they have a neighbour (in G) in A. 

By the vertex isoperimetric inequality in the square there are at most u 2 /2 
squares in A c \ N so |yl c | < u 2 /2 + u < 2.5 • 10 7 . 

Next we prove a result similar to Corollary [71 



Lemma 16. The set of cutoff squares A c is in Nd (where D = 10 4 as above). 
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Proof. Suppose not, and that Q is a square in A c not in Nd- Then all squares within 
£oc distance of Q at most D are not in N. Hence they must be in A° (since otherwise 
there would be a path from Q to a square in A not going through any square in N). 
Hence A c > D 2 = 10 8 which contradicts Lemma [T5l □ 

Finally, we need the analogue of Lemma [8] whose proof is exactly the same. 
Lemma 17. The set Nd n A is connected in G. □ 

Stage 4 : Dealing with the difficult sub squares. 

Let us deal with these cutoff squares now. From each cutoff square that contains 
at least two vertices pick any 2 vertices and from each cutoff square that contains a 
single vertex pick that vertex with multiplicity two. We have picked at most 5 • 10 7 
vertices, so since G is k = 5- 10 7 connected we can simultaneously find vertex disjoint 
paths from each of our picked vertices to vertices in squares in A (two paths from 
those vertices that are repeated). 

We remark that these are not just single edges: these paths may go through other 
cutoff squares. 

Call the first point of such a path which is in the sea a meeting point, and the 
square containing it a meeting or terminating square. 

Each cutoff square has two meeting points say in subsquares Qx,Q2- Since the 
longest edge is at most r + both Q\ and Q2 are in Nd- Since A n Nd is connected 
in G we construct a path in the squares in A n Nd from the meeting point in Q2 to 
a vertex in Q± using at most one vertex in each subsquare on the way, and missing 
all the other meeting points. This is possible since each full square contains at least 
M = 10 9 vertices. 

Place the path from the terminating point in Q% to the cutoff square, round all 
the vertices in the cutoff square not used in the rest of the construction, followed by 
the path back to the meeting point in Q2 and then through the sea as above back 
to Q\. We have a path starting and finishing in the same subsquare (Qi) containing 
all the (unused) vertices in the cutoff square. 

Do this for every cutoff square. When doing this, in each square in the sea we 
use at most two vertices for each cutoff square. Moreover, obviously only squares in 
Nd are touched by this construction. Since non-full squares in distinct components 
are at distance at least 2D the squares touched by different non-full components are 
distinct. Thus in total we have used at most 4 • 10 vertices in any square in the 
sea, and since M = 10 9 there are many (we shall only need 8) unused vertices left 
in each full square in the sea. 

Stage 5: Using the subsquares in the sea to join everything together. 
This is exactly the same as before. 
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5 Comments on the /c-nearest neighbour proof 

We start by giving some reasons why the proof in the /c-nearest neighbour model only 
yields the weaker Theorem [3j The first superficial problem is that we use squares in 
the tesselation which are of "large" size rather than relatively small as in the proof 
of Theorem [TJ (in other words we did not introduce the constant c when setting s 
depending on r). 

Obviously we could have introduced this constant. The difficulty when trying 
to mimic the proof of Theorem Q] is the large difference between r_ and r + , which 
corresponds to having a very large number of squares (many times 7rc 2 ) in our non- 
full component N. This means that we cannot easily prove anything similar to 
Lemma [5j Indeed, a priori, we could have two far squares with ire 2 non-full squares 
around each of them. 

A different way of viewing this difficulty is that, in the /c-nearest neighbour model, 
the graph G on the small squares does not approximate the real graph G very well, 
whereas in the Gilbert Model it is a good approximation. Thus, it is not surprising 
that we only prove a weaker result. 

This is typical of results about the A;-nearest neighbour model: the results tend 
to be weaker than for the Gilbert Model. This is primarily because the obstructions 
tend to be more complex: for example, the obstruction for connectivity in the Gilbert 
Model is the existence of an isolated vertex. Obviously in the /c-nearest neighbour 
model we never have an isolated vertex; the obstruction must have at least k + 1 
vertices. 

Extensions of Theorem [3] 

When proving Theorem [3] we only used two facts about the random geometric graph. 
First, that any two points at distance r_ = 0.035ydl3gn are joined whp. Secondly, 
that the ratio of r + (the longest edge) to r_ (the shortest non-edge) was at most 
60 whp. Obviously, we could prove the theorem (with different constants) in any 
graph with r_ = ©(ydog n) and r + /r„ bounded. This includes higher dimensions 
and different norms and to different shaped regions instead of S n (e.g. to disks or 
tor uses). Indeed, the only place we used the norm was in obtaining the bounds on 
r + and r_ in stage 1 of the proof. 

Indeed, it also generalises to irregular distributions of vertices provided that the 
above bounds on r_ and r+ hold. For example, it holds in the square S n where the 
density of points in the Poisson Process decrease linearly from 10 to 1 across the 
square. 
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6 Closing Remarks and Open Questions 

A related model where the result does not seem to follow easily from our methods 
is the directed version of the fc-nearest neighbour graph. As mentioned above, the 
/c-nearest neighbour model naturally gives rise to a directed graph and we can ask 
whether this has a directed Hamilton cycle. Note that this directed model is sig- 
nificantly different from the undirected: for example it is likely (see [T]) that the 
obstruction to directed connectivity (i.e., the existence of a directed path between 
any two vertices) is a single vertex with in-degree zero; obviously this cannot occur in 
the undirected case where every vertex has degree at least k. In some other random 
graph models a sufficient condition for the existence of a Hamilton cycle (whp) is 
that there are no vertices of in-degree or out-degree zero. Of course, in the directed 
/c-nearest neighbour model every vertex has out-degree k so we ask the following 
question. 

Question. Let G = G n ^ be the directed k-nearest neighbour model. Is 

7i(G has a Hamilton cycle) = 7i{G has no vertex of in-degree zero) 

whp? 

It is obvious that the bound on connectivity in the fc-nearest neighbour model 
can be improved, but the key question is "should it be two?" We make the following 
natural conjecture: 

Conjecture. Suppose that k = k(n) such that the k-nearest neighbour graph G = 
G(k, n) is a 2-connected whp. Then, whp, G has a Hamilton cycle. 
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